AITopics | Luxor Governorate

Collaborating Authors

Luxor Governorate

TeaserGen: Generating Teasers for Long Documentaries

Xu, Weihan, Liang, Paul Pu, Kim, Haven, McAuley, Julian, Berg-Kirkpatrick, Taylor, Dong, Hao-Wen

arXiv.org Artificial IntelligenceNov-9-2024

Teasers are an effective tool for promoting content in entertainment, commercial and educational fields. However, creating an effective teaser for long videos is challenging for it requires long-range multimodal modeling on the input videos, while necessitating maintaining audiovisual alignments, managing scene changes and preserving factual accuracy for the output teasers. Due to the lack of a publicly-available dataset, progress along this research direction has been hindered. In this work, we present DocumentaryNet, a collection of 1,269 documentaries paired with their teasers, featuring multimodal data streams of video, speech, music, sound effects and narrations. With DocumentaryNet, we propose a new two-stage system for generating teasers from long documentaries. The proposed TeaserGen system first generates the teaser narration from the transcribed narration of the documentary using a pretrained large language model, and then selects the most relevant visual content to accompany the generated narration through language-vision models. For narration-video matching, we explore two approaches: a pretraining-based model using pretrained contrastive language-vision models and a deep sequential model that learns the mapping between the narrations and visuals. Our experimental results show that the pretraining-based approach is more effective at identifying relevant visual content than directly trained deep autoregressive models.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2410.05586

Country:

Asia > Afghanistan > Kabul Province > Kabul (0.05)
Africa > Middle East > Egypt > Aswan Governorate > Aswan (0.05)
North America > United States > New York > New York County > New York City (0.04)
(6 more...)

Genre: Research Report > New Finding (0.34)

Industry:

Leisure & Entertainment (1.00)
Government > Military (0.69)
Media > Film (0.47)
Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)

Add feedback

Foundation Models for Natural Language Processing -- Pre-trained Language Models Integrating Media

Paaß, Gerhard, Giesselbach, Sven

arXiv.org Artificial IntelligenceFeb-16-2023

This open access book provides a comprehensive overview of the state of the art in research and applications of Foundation Models and is intended for readers familiar with basic Natural Language Processing (NLP) concepts. Over the recent years, a revolutionary new paradigm has been developed for training models for NLP. These models are first pre-trained on large collections of text documents to acquire general syntactic knowledge and semantic information. Then, they are fine-tuned for specific tasks, which they can often solve with superhuman accuracy. When the models are large enough, they can be instructed by prompts to solve new tasks without any fine-tuning. Moreover, they can be applied to a wide range of different media and problem domains, ranging from image and video processing to robot control learning. Because they provide a blueprint for solving many tasks in artificial intelligence, they have been called Foundation Models. After a brief introduction to basic NLP models the main pre-trained language models BERT, GPT and sequence-to-sequence transformer are described, as well as the concepts of self-attention and context-sensitive embedding. Then, different approaches to improving these models are discussed, such as expanding the pre-training criteria, increasing the length of input texts, or including extra knowledge. An overview of the best-performing models for about twenty application areas is then presented, e.g., question answering, translation, story generation, dialog systems, generating images from text, etc. For each application area, the strengths and weaknesses of current models are discussed, and an outlook on further developments is given. In addition, links are provided to freely available program code. A concluding chapter summarizes the economic opportunities, mitigation of risks, and potential developments of AI.

large language model, machine learning, pattern recognition, (32 more...)

arXiv.org Artificial Intelligence

2302.08575

Country:

Europe > Ukraine > Kyiv Oblast > Kyiv (0.13)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.13)
North America > Canada > Ontario > Toronto (0.13)
(43 more...)

Genre:

Workflow (1.00)
Summary/Review (1.00)
Research Report > Promising Solution (1.00)
(4 more...)

Industry:

Transportation > Passenger (1.00)
Media > Television (1.00)
Media > News (1.00)
(21 more...)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
(23 more...)

Add feedback

Macro Room create video using ink and water

Daily Mail - Science & techMay-5-2017, 19:40:50 GMT

A hypnotic new video reveals the unearthly beauty of life up-close. Using different colored ink in water, the team at the Macro Room has created a breathtaking short film that could rival the effects of CGI. A hypnotic new video reveals the unearthly beauty of life up-close. Ink In Motion, shared on YouTube by the Macro Room, gives a close-up look at'the hypnotising beauty of colored ink in water and the interaction of this substance with different elements.' It begins with just a tank filled with water, and 3D planet models submerged in the center.

artificial intelligence, ink, machine learning, (16 more...)

Daily Mail - Science & tech

Country: Africa > Middle East > Egypt > Luxor Governorate > Luxor (0.05)

Industry:

Media > Film (0.73)
Leisure & Entertainment (0.73)

Technology:

Information Technology > Artificial Intelligence > Vision (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)

Add feedback

Map shows parts of the US most at risk of a robot takeover

Daily Mail - Science & techMay-5-2017, 19:40:12 GMT

Researchers have warned that millions of human workers in the US will be replaced by robots over the next few decades, leaving Americans to wonder what areas are at the highest risk. Now, a new map has shown where the most'automatable' jobs are in the nation - and almost every metropolitan area is set to experience a robot takeover. However, it is the low-wage cities like Las Vegas, Nevada, El Paso, Texas and San Bernardino, California that will be hit the hardest – robots are predicted to take more than 60% of jobs in these cities by 2035. A new map has shown where the most'automatable' jobs are in the nation - and almost every metropolitan area is set to experience a robot takeover. The bubble size shows the number of workers employed in the metropolitan areas in December 2016.

artificial intelligence, metropolitan area, robot, (14 more...)

Daily Mail - Science & tech

Country:

North America > United States > Texas > El Paso County > El Paso (0.25)
North America > United States > Nevada > Clark County > Las Vegas (0.25)
North America > United States > California > San Bernardino County > San Bernardino (0.25)
(5 more...)

Industry: Government > Regional Government > North America Government > United States Government (0.72)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

Prominent al-Qaida figure killed in US drone strike in Syria

U.S. NewsApr-8-2016, 14:55:32 GMT

A senior Egyptian al-Qaida figure fighting in Syria was killed in a U.S. drone strike this week, the latest to be killed in such attacks in Syria, a Syrian opposition monitoring group and relatives said Friday. The Britain-based Syrian Observatory for Human Rights said Rifai Ahmad Taha was killed in a strike Tuesday in the northwestern Idlib province. Before joining al-Qaida, Taha was a top figure in Egypt's notorious militant group Gamaa Islamiya, which massacred 58 foreign tourists in the ancient Egyptian city of Luxor in 1997. He was also allied with Osama bin Laden in Afghanistan. The Observatory's chief Rami Abdurrahman said several al-Qaida members, including Taha, were killed in Tuesday's strike.

artificial intelligence, syria, taha, (12 more...)

U.S. News

Country:

Asia > Middle East > Syria > Idlib Governorate > Idlib (0.27)
Asia > Afghanistan (0.27)
Europe > United Kingdom (0.26)
(9 more...)

Industry:

Government > Military (1.00)
Government > Regional Government > Asia Government > Middle East Government > Syria Government (0.37)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.93)

Add feedback